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i  ABSTRACT 

This  paper  deals  with  a  classification  problem  based  on  ranking  and  selection  ap¬ 
proach.  We  assume  that  the  populations  follow  multivariate  normal  distribution.  The 
corresponding  selection  problem  is  to  choose  the  population  with  the  smallest  Mahalanobis 
distance.  The  subset  selection  approach  is  considered  throughout  this  paper.  Sometimes 
l  the  indifference  zone  approach  is  also  proposed.  It  should  be  pointed  out  that,  for  the 

subset  selection  approach,  we  need  not  assume  that  the  individual  to  be  classified  belongs 
to  one  of  the  several  given  categories.  The  classification  procedures  depend  on  whether 
the  parameters  £  and  S,  are  known  or  unknown. 

I 
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1.  Introduction 


The  problem  of  classification  arises  when  an  investigator  makes  a  number  of  measure¬ 
ments  on  an  individual  and  wishes  to  classify  the  individual  into  one  of  several  categories 
on  the  basis  of  these  measurements.  Since  Fisher  (1936)  introduced  the  linear  discriminant 
function  for  distinguishing  between  two  multivariate  normal  distributions  with  a  common 
covariance  matrix,  a  great  deal  of  work  has  been  done  by  many  authors  on  this  prob¬ 
lem.  For  in  extensive  bibliography,  the  reader  is  referred  to  Anderson,  Das  Gupta  and 
Styan  (1972),  Das  Gupta  (1973)  and  Lachenbruch  (1975).  For  a  general  approach  to  this 
problem,  Anderson  (1984)  is  a  good  reference. 

In  general,  for  the  classification  problem,  we  assume  that  the  individual  to  be  classified 
belongs  to  one  of  the  several  categories.  In  a  real  situation,  this  assumption  may  not  be 
appropriate.  Thus  the  problem  of  selecting  the  nearest  category  to  the  individual  based  on 
distance  function  was  considered  to  cover  the  above  drawback.  For  the  decision  theoretic 
approach  see  Cacoullos  (1965a,  1965b)  and  Srivastava  (1967).  Although  some  intuitive 
classification  procedures  have  optimality  properties  based  on  decision  viewpoint,  in  practice 
we  want  to  control  the  probability  of  misclassification.  Using  the  classical  approach,  it  is 
difficult  to  control  this  probability.  Hence  an  approach  based  on  the  concept  of  ranking  and 
selection  was  considered  by  Cacoullos  (1973)  and  A.K.  Gupta  and  Govindarajulu  (1973, 
1985).  Unfortunately,  their  results  are  too  conservative  and  very  lir.ited. 

Let  CC  stand  for  a  correct  classification  and  R  denote  a  classification  procedure,  for 
a  given  constant  P* ,  1/k  <  P*  <  1,  we  want  to  choose  a  classification  procedure  R  to 
satisfy  the  probability  requirement  (1.1) 

P{CC\R)  >  P*  (1.1) 

where  P(CC\R)  is  the  probability  of  a  correct  classification  when  the  procedure  R  is  used. 
To  make  the  classification  problem  more  precise,  we  may  ask  the  problem:  can  one  find  a 
classification  procedure  satisfying  the  probability  requirement  (1.1)  and  what  is  the  sample 
size  needed? 

Let  7T(,  t  =  0, 1, . . . ,  k,  be  k  +  1  populations,  we  want  to  classify  7Tq  as  one  of  the  7r,-,  t  = 
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1 , . . . ,  jfc.  We  assume  that  7r*  ~  Np  \^p.,  E,  J  (the  p-variate  multivariate  normal  distribution 
with  mean  vector  p.  and  covariance  matrix  E ,),  i  =  0, 1, . . . ,  k.  Based  on  the  Mahalanobis 
distance  between  two  populations,  our  problem  is  related  to  the  problem  of  selecting  the 
smallest  Mahalanobis  distance.  The  subset  selection  approach  is  considered  throughout 
this  paper.  Sometimes  the  indifference  zone  approach  is  also  used.  The  classification 
procedures  depend  on  whether  the  parameters  p.  and  E,  are  known  or  unknown. 

2.  Classification  procedures  when  u  known,  p.,  *  =  1, . . . ,  k,  unknown 

_  _  _  — U  -  *  -  - 


In  this  section  we  assume  that  p^  is  known  and  p.,  i  =  1 ,. . .  ,fc,  are  unknown.  The 
Mahalanobis  distance  of  tt,-  and  no  is  defined  to  be  9i  =  (^p.  —  p^j  E"1  (p^  ~  tto)  '  *  = 

l,...,k.  Let  <  ...  <  0[fc]  denote  the  ordered  values  of  the  0,’s,  t  =  1  Our 

classification  problem  may  reduce  to  a  selection  problem  which  selects  the  population 
corresponding  to  the  parameter  We  will  classify  no  as  the  selected  population  when 
the  indifference  zone  approach  is  used,  and  classify  no  as  any  one  population  in  the  selected 
subset  when  the  subset  selection  approach  is  used. 


Let  2Lij,  j  —  1, ...  ,n,  be  a  random  sample  from  population  7r,,  Xj  ~  „  52  %-ij  Le 

;  =  1 

n  _  _  t 

the  sample  mean  vector,  St  =  Yh  ( K.ij  ~  K.% )  (2C,y  —  Ki)  be  the  sample  covariance 

j'=i 

k 

matrix  for  the  population  tt,-,  i  =  l,...,fc,  respectively  and  5  =  ^  5,-  be  the  pooled 

«=i 

sample  covariance  matrix  of  populations  ni,...,nk.  Throughout  this  paper,  we  denote 
Xp-s  3s  the  noncentral  chi-square  distribution  with  degrees  of  freedom  p  and  noncentrality 
parameter  6  and  FPtq;s  as  the  noncentral  .F-distribution  with  degrees  of  freedom  p  and  q 
and  noncentrality  parameter  6.  Also,  let  Gp(x;6)  denote  the  cdf  of  xp-s  and  FPiq(x;6)  the 
cdf  of  FPiq-g.  When  6  —  0  we  simplify  these  by  Xp,  FPtq,  Gp( x)  and  Fp>q(x)  respectively. 


We  will  discuss  the  classification  procedures  in  various  situations. 

2.1.  E»,  *  =  1, ..., k,  known 

2.1.1.  Indifference  Zone  Approach 


For  the  indifference  zone  approach,  we  will  make  the  assumptions  0[!|  =  0,  i.e.  no 
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belongs  to  one  of  the  7r,,  i  =  1, . . . ,  k,  and  0[2 ]  >  A,  where  A  is  a  given  positive  constant. 
Let  Yi  =  n  (Xi  ~  tio)  E,r  1  (Z,  ~  M0)  ,  t  =  1,  — ,  k,  then  F,  ~  x •  Intuitively,  we  will 
consider  the  classification  procedure  Ri  as  follows: 

Rx:  Classify  -kq  as  7ry  if  and  only  if  F,  =  min  Fy. 

i<y<k 

For  a  given  P*,  we  want  to  find  an  appropriate  sample  size  n  so  that  the  probability 
requirement  (1.1)  is  satisfied.  The  following  theorem  is  useful  for  this  problem. 

Theorem  2.1.  inf  P(CC|Pi)  =  /0°°[1  -  Gp{x-,n£Y)}k~'dGp{x).  (2.1) 

Proof.  Let  Yu\  denote  the  statistic  corresponding  to  the  parameter  0(,j.  Then 

P(CC\Ri)  —  P  {F(1)  <  F(y),  j  =  2,...,k} 


f°°  Jl 

=  /  n  i1  ~  °p  (z; n0ui)i  dGp{x) 

j  =2 

fOO 

>  /  [1  -  Gp(x;nA)]fc-1dGp(x) 

Jo 


The  inequality  (2.2)  holds,  since  Gp(x\6)  has  the  stochastic  increasing  property  and  the 
equality  is  attained  when  0[2]  =  . . .  =  0[/t]  =  A.  □ 

Remark  2.1.  Let  d,  be  the  solution  of  the  equation 


/  [l  —  Gp(x]d)]k~1dGp[x)  =  P* 
Jo 


and  n*  be  the  smallest  positive  integer  such  that  nA  >  d.  Then  n*  is  the  sample  size 
needed  to  guarantee  the  probability  requirement  (1.1). 

2.1.2.  Subset  Selection  Approach 

For  the  subset  selection  approach,  Gupta  (1966)  and  Gupta  and  Studden  (1970)  have 
considered  the  problem  for  selecting  the  smallest  parameter  of  i  =  1 

Following  their  idea,  we  consider  the  classification  procedure  P2  as  follows: 

P2:  Classify  n0  as  any  one  of  the  7r,’s  for  which  F,  <  c2  min  Fy,  where  c2  >  1  is  the 

1  <j<k 

smallest  value  such  that  the  probability  requirement  (1.1)  is  satisfied. 

By  applying  Theorem  3.1  of  Gupta  and  Studden  (1970),  we  have  the  following  theo- 
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Theorem  2.2.  inf  P{CC\R2)  =  J0°°[l  -  Gp(x/ c2)]k-1dGp{x). 
Proof.  P(CC\R2)=P{Y{1)<c2y{j),  j  =  2,...,fc} 


(2.4) 


roo  J* 

=  /  n  i1  ~ (z/c2;nd[j])i (x;n0[i]) 

J°  j= 2 

>/  [l  -  Gp  (x/c2;n^[i])]fc_1  dGp  (x;n0[!])  (2.5) 

./o 

roo 

>  /  [l-Gp(x/c2)!fc-1dGp(x).  (2.6) 

Jo 

The  equality  in  (2.5)  holds  when  0(i]  =  0(2)  =  . . .  =  0[*j  and  the  equality  in  (2.6)  holds 
when  0[!]  =0.  □ 

Remark  2.2.  The  values  c2  satisfying  the  equation  /0°°[l  —  Gp(x/c2)]fc-1dGp(x)  =  P*  can 
be  found  from  Gupta  and  Sobel  (1962)  and  Armitage  and  Krishnaiah  (1964). 

On  the  other  hand,  we  may  consider  another  classification  procedure  Rz  as  follows: 

Rz:  Classify  tt0  as  any  one  of  the  7r,’s  for  which  Y,  <  c 3,  where  C3  is  the  smallest  positive 
constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 

For  the  determination  of  the  value  C3,  we  can  use  the  following  theorem: 

Theorem  2.3.  inf  P{CC\Rz)  =  Gp(c3),  if  =  0. 

Proof.  P{CC\Rz)  =  P  {Y(i)  <  c3} 

=  Gp(c3)  if  0[i]  =0.  □ 


The  values  of  C3  can  be  found  from  xp~tables. 

Remark  2.3.  Note  that  if  we  use  the  procedure  Rz,  the  selected  subset  may  be  an  empty 
set.  This  drawback  suggests  that  7r0  may  not  belong  to  one  of  the  n t  =  1,...  ,k. 

Remark  2.4.  We  may  consider  a  testing  problem:  Ho- ^  for  some  t,  »  =  1 

vs.  ^  fi.,  i  =  1  For  a  level  a  test,  the  suggested  rejection  region  could  be 

min  Yj  >  d,  where  d  is  the  C3  value  determined  by  the  procedure  R3  and  the  associated 
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probability  P*  is  1  —  a.  We  note  that 

Ph°  Y>^d)  =  i-  p«-  (,§&  < d) 

<l~PHo  (V(i)  <  d)  =  1  -  P*  =  a. 

2.2.  t  =  unknown,  not  all  equal 

2.2.1.  Indifference  Zone  Approach 

If  £*,  t  =  1, . . . ,  k,  are  unknown  but  not  all  equal,  we  may  estimate  0,  by  (x_i  ~  tio)  1 

(Kx  ~  Mq)  ,  *  =  1, •  •  • , k.  Let  Y*  =  n  (x,-  -  /z^  S~}  (x,-  -  ,  t  =  and  define 

a  classification  procedure  R4  as  follows: 

R4:  Classify  tt0  as  7r.  if  and  only  if  Y*  =  min  Y* . 

i<i<fc  J 

We  note  that  Y*  ~  *=^FP 

,n— p;n^i  and  FPin-p{x\n6i)  has  the  stochastic  increasing 
property.  If  we  assume  that  0[!]  =  0  and  d^\  >  A,  then  we  have  the  following  theorem: 

Theorem  2.4.  inf  P{CC\R4)  =  /“[l  -  .FPjn_p(x;  nA)]fc-1<LFPiri_p(x).  (2.7) 

Proof.  P(CC\R<)  =  p  <  tSEfcYfo,  j  =  2,...,*} 

r  o°  fc 

=  /  [l  —  Fp,n-p  (a:; n0[j] ) ]  dFPin_p(x) 

j= 2 

poo 

>  /  [l-Fp,n_p(x;nA)]fc-1dFpin_p(x).  □ 

Jo 

Remark  2.5.  In  order  to  satisfy  the  probability  requirement  (1.1),  we  should  choose  the 
smallest  n  such  that  nA  >  d  and  /0°°(l  —  FPirv_p(x; d)}k~1dFp>n-p(x)  =  P* . 

2.2.2.  Subset  Selection  Approach 

Analogous  to  Gupta  and  Studden  (1970),  we  consider  the  classification  procedure  R5 
as  follows: 

Rs :  Classify  7r0  as  any  one  of  the  7r,’s  for  which  Y-*  <  C5  min  Y,*,  where  C5  >  1  is  the 

i<y<fc  1 

smallest  constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 

By  applying  Theorem  3.1  of  Gupta  and  Studden  (1970),  we  have  the  following  theo¬ 
rem: 
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(2.8) 


Theorem  2.5.  inf  F(CC|i?s)  =  /0°°jl  -  Pp,n-p(z/c5)]fc  '(ffp, n_p(x). 

Proof.  P(CC\RS)  =  P  {jSfcl?,,  5  ‘■p55?1or  i  =  2 . *} 

/oo  j_ 

JJ  [l  —  Fpin_p  (x/cs; dFPtn-p  (x;n0[i|) 

1  y=2 

/*°o 

—  /  [l  —  ^p,n— p  (^/cs;  dFptn_p  (x;n#[jj) 

•/  o 
f°° 

—  I  [l  —  •^p.'»-p(a:/c5)]  ^-Fp.n-pl1)*  Q 

JO 

Remark  2.6.  The  value  of  c$  is  the  solution  of  the  equation 

POO 

I  [l  —  -fpin-p(z/c5)]fc  ldFpin~p(x)  =  P  . 

Jo 

On  the  other  hand,  we  may  consider  a  classification  procedure  Re  as  follows: 

R&:  Classify  7r0  as  any  one  of  the  7Ti’s  for  which  Y*  <  c6,  where  c6  is  the  smallest  positive 
constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 

For  the  determination  of  the  c&  values,  it  is  easy  to  show  that 

Theorem  2.6.  inf  P(CC\Re)  =  -Fp.n-p  if  0[i]  =  0. 

The  values  of  ce  can  be  found  from  the  Fp  n_p-tables. 

Remark  2.7.  Note  that  if  we  use  the  procedure  Re,  the  selected  subset  may  be  an  empty 
set. 

Remark  2.8.  We  may  consider  a  testing  problem:  for  some  i,  i  —  1, . . .  ,k, 

vs.  ^  fx.,  i  =  l,...,k.  For  a  level  a  test,  the  suggested  rejection  region  could  be 

min  Y,*  >  d,  where  d  is  the  cq  value  determined  by  the  procedure  Re  and  the  associated 

i<j<k  3 

probability  P*  is  1  —  a. 

2.3.  E »  =  £,  *  =  1, . . . ,  k,  unknown 

When  Ei  =  E,  i  =  1  are  unknown,  we  estimate  E  by  S  and  let  Y**  = 

n(X  -^o)  s~'  ,  i  =  l,...,k.  We  note  that  Y{*  ~  v_pp+{^p,v-P+i;niy. , 
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where  v  =  k(n  -  1)  and  a'Y,~la/Chl{SY,~1)  >  a'S~la  >  a'E~la/Chp(S'£~1)  for  any 
p  x  1  vector  a,  where  C/it(S£_1)  is  the  t-th  smallest  eigenvalue  of  the  matrix  SE_1.  Let 
Z  =  C /ii(S£-1) / C hp(5E_1)  and  H(z)  be  the  cdf  of  Z  ( H  is  independent  of  i  = 
1, ...  ,1b,  and  E).  This  problem  was  considered  by  Chattopadhyay  (1981)  for  subset  selec¬ 
tion  approach. 

2.3.1.  Indifference  Zone  Approach 

For  the  indifference  zone  approach,  we  consider  the  classification  procedure  Rj  as 
follows: 


R7 :  Classify  tro  as  7 r;  if  and  only  if  Y-m*  =  min  Y* * . 

i</<*  ; 

For  this  procedure,  the  following  result  can  be  used  to  determine  the  required  sample 
size  to  satisfy  the  probability  requirement  (l.l). 

Theorem  2.7.  inf  P{CC\R7)  >  fQl  /0°°[l  -  Gp(x/z;  nA)]k-ldGp{x)dH{z).  (2.9) 

Proof.  P(CC\R7)  =  p  {y(V)  <  r(*),  y  =  2 . a?} 

>  p{n(Z(1)  -ajV*  (Z(1)  -  fio)  <  igh}"(Z<„  -&)V'  (Iw  -£o) , 
j  =  2,. . .  ,k 


nOO  ^ 

IJ  [!  -  Gp  (x/z;n0[y))]  dGp(x)cftf(z) 
1  3= 2 

noo 

[1  -  Gp(x/2;nA)]fc-1dGp(x)d//(2). 

1 


□ 


Remark  2.9.  Since  C/ij(S£  1)/G/ip(5E  x)— >1  as  n  ->  oo,  we  have  P(CC|R7)  -+  1  as 
n  —*  oo. 


Remark  2.10.  The  distribution  of  C/ii(SE  l)/Chp(S E  x)  can  be  found  in  Pillai,  Al-Ani 
and  Jouris  (1969). 

Remark  2.11.  It  is  easy  to  show  that  inf  P(CC|R7)  >  P(Z  >  0e)  J^°[l  —  Gp(x/6c:\nA)\k~xdGp{x), 
where  e  >  0  and  P(Z  <  6e )  =  e.  The  equation  P{Z  >  9e)  /o°°[l-Gp(x/0e;  nA)]fc-1dGp(x)  = 
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P*  can  be  used  to  determine  the  required  sample  size. 


2.3.2.  Subset  Selection  Approach 

For  the  subset  selection  approach  we  refer  to  Chattopadhyay  (1981).  We  consider  the 
classification  procedure  Rg  as  follows: 

Rg:  Classify  no  as  any  one  of  the  7rt’s  for  which  Y*'  <  eg  min  Y" ' ,  where  eg  >  1  is  the 

i <y<fc  ] 

smallest  constant  such  that  the  probability  requirement  (l.l)  is  satisfied. 

Analogous  to  the  proof  of  Theorem  2.7,  we  have  the  following  result: 

Theorem  2.8.  inf  P{CC\R8)  >  fQ 1  /0°°[1  -  Gp(x/esz)\k-ldGp(x)dH{z).  (2.10) 

Remark  2.12.  /0°°[1  -  Gp(x/cgz)\k~ldGp(x)dH (z)  — >  1  as  n  — ►  oo. 

Remark  2.13.  It  is  easy  to  show  that 

f°° 

inf  P(CC\Rg)  >  P(Z  >  0e)  I  \l-Gp{x/cs9e)\k-ldGp{x). 

Jo 

On  the  other  hand,  we  may  consider  an  easier  classification  procedure  Rg  defined  as 
follows: 


Rg:  Classify  no  as  any  one  of  the  7T,’s  for  which  Y%"  <  eg,  where  eg  is  the  smallest  positive 
constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 


It  is  easy  to  show  that 

Theorem  2.9.  inf  P{CC\Rg)  =  FPlV-p+i  (^^-cg)  if  0(1]  =0. 

Remark  2.14.  We  can  consider  the  testing  problem:  i/0:p0  =  f±.,  for  some  i,  i 
vs.  H  i:Mq  7^  f  =  l,...,fc.  The  reject  region  is  min^  Y**  >  d. 

3.  Classification  procedures  when  unknown,  p.,  i  =  1, . . . ,  k,  known 


1 ,  .  .  .  ,  A.  , 


In  the  case  that  ^  is  unknown  and  i =  1, . . .  ,k,  are  known.  Let  X01,  •  •  •  i^on  be  a 

random  sample  from  n0  and  X 0  =  y,  S0  =  £  (X0;  -  X0)  (X0j  -  Xq)'.  The 

;=i  ;= i 

Mahalanobis  distance  between  populations  no  and  n<  is  defined  to  be  A,  =  Hq  1 

(fi.  -  pV  We  will  discuss  the  classification  procedures  in  various  situations. 
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3.1.  £o  known 


3.1.1.  Indifference  Zone  Approach 


For  the  indifference  zone  approach,  we  assume  that  A(2j  —  A[jj  >  A.  We  define  Z,  — 

Sq  1  —  Mf)  i  t  =  1  Intuitively,  we  may  consider  the  classification 

procedure  Rio  as  follows: 

i?io:  Classify  no  as  tt,  if  and  only  if  Z,  =  min^  Zj. 

For  the  procedure  R io,  we  note  that 

Z,  <  Zy,  j  —  1, . . . ,  k,  j  /  i  iff  2  Eq  1  ( X0  —  <  Ay  —  A,-. 

Thus  we  have  the  following  result: 

Theorem  3.1.  inf  P(CC |/210)  >  1  -  (Jfc  -  1)$  (-^),  (3.1) 

where  $  is  the  cdf  of  the  standard  normal  and  <5 2  =  max  62,  62  ~  max  ( a  .  —  u\  Sn  1  (u  u 

Ki<k  ’  *  »<><*  \-J  -»/  0  \-J 

i*i 

Proof.  P(CC\Rio)  =  P  {Z(1)  <  Z(yj,  j  =  2,...,k} 

=  P  |2  _^(i))  So  1  (*o  ~  Ho)  -  ^U1  ”  ^(il»  ]  =  2,...,fc| 

-  p  |2  {tL(j)  ~  i))  so  1  {Zo  Ho)  <  A,  j  =  2,. . .  ,fc  j 

a 1  -Ep{2  tw  (Zo  -&,)  >  a} 


=  i-£>  - - 

’  2  , 2  [(-0)  ~  —  1 1 ; )  Eo' fay)  "S(i)) 


The  inequality  (3.2)  holds  since  62  >  S0  1  ^ 

Remark  3.1.  As  n  — ►  oo,  $  (  A  j  — >  0,  hence  P(CC|Pio)  — ►  1.  For  given  P*,  we  can 


find  n  such  that  inf  P(CC|Pio)  >  P*. 
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3.1.2.  Subset  Selection  Approach 


For  the  subset  selection  approach,  we  consider  a  classification  procedure  Ri  i 


as  follows: 


JRn:  Classify  tto  as  any  one  of  the  7r,’s  for  which  Z<  <  min  Z;-  +cU)  where  cn  is  the 

1 

smallest  positive  constant  such  that  the  probability  requirement  (l.l)  is  satisfied. 


Analogous  to  the  proof  of  Theorem  3.1,  we  have  the  following  result: 

Theorem  3.2.  inf  P{CC\Rn)  >  1  -  (*  -  1)S  (^£il) •  (3.3) 

Proof.  P(CC\Rn)  =  P  [Z{i)  <  ZU)  +  cn,  j  =  2, . . . ,k} 


~  P  |2  (i£(y)  f£(  1))  ^o1  (Ko  A£q)  —  ^b'l  ^[l]  +  cu,  J  -  2,. . .  ,fcj 

—  P  | ^  ~  (^o  ~  Hq)  —  c 1 1 »  J  =  2, . . . , 


On  the  other  hand,  we  suggest  another  classification  procedure  i?i2  as  follows: 

#12:  Classify  7r0  as  any  one  of  the  7Tj’s  for  which  nZ{  <  ci2,  where  ci2  is  the  smallest 
positive  constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 

For  this  procedure,  it  is  easy  to  show  that 

Theorem  3.3.  inf  P(CC\Ri2)  —  Gp(ci2)  if  A[i]  =  0. 

Remark  3.2.  We  may  consider  a  testing  problem:  =  A£,,  for  some  i,  i  =  l,...,fc, 

vs.  /j,.,  *  =  1 ,k.  The  suggested  reject  region  is  min  Zj  >  d. 

3.2.  So  unknown 

When  So  is  also  v  town,  we  estimate  it  by  So,  and  let  Z*  =  n  ^ XQ  -  S^1  ^Xo  -  fi.  j , 

i  =  1, . . .  ,k.  For  the  rence  zone  approach,  we  consider  the  classification  procedure 

Ri 3  as  follows: 

R13:  Classify  wo  as  7r,  if  and  only  if  Z,*  =  min  Z.*. 

i<j<fc  } 
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Analogous  to  the  proof  of  Theorem  2.7,  and  Theorem  3.1,  for  large  sample,  we  have 
the  following  result: 

Theorem  3.4.  inf  P(CC\Ri3)  >  1  —  (A;  —  1)$  j ,  if  «  is  large  enough. 

Proof.  P{CC\R13)  =  P  {z(*1}  <  Z(*y),  j  =  2,...,fc} 


>  P 


< 


Ch1{So^)n 

Chp{S0Xol)n 


(Xo-S,,)),i  =  2,. 

53  -P  |n  (^0  —  /f (!))  ^0  1 


(Zo  -i£(1))  ^  n  (^0 

,  if  n  is  large  enough. 


For  the  subset  selection  approach,  we  consider  the  classification  procedure  #14  as 
follows: 

#u :  Classify  7T0  as  one  of  the  7rt’s  for  which  Zt*  <  min  Z*  -f  Ci4,  where  cJ4  is  the  smallest 

1  <  j  <  k  3 

positive  constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 

For  large  sample,  it  is  easy  to  show  that 

Theorem  3.5.  inf  P(CC\Ru)  >  1  —  (k  —  l)$  ( ~V2gCM )• 

_  On  the  other  hand,  a  simple  procedure  #15  can  be  defined  as  follows: 

#15:  Classify  7To  as  one  of  the  7r,-’s  for  which  Z,*  <  C15,  where  C15  is  the  smallest  positive 
constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 

Since  Z,‘  ~  ^Fp,n.r  it  is  easy  to  show  that 

Theorem  3.6.  inf  P(CC\Ri5)  =  Fp>n_p  ((n-i$pc™)  if  A^j  =  0. 

4.  Classification  procedures  when  t  =  0, 1, . , . ,  fc,  unknown 

When  £  ,  i  =  0, 1, . . . ,  k,  are  unknown,  we  use  0j  defined  in  Section  2asa  measurement 
of  distance  between  populations  7 r0  and  7r,-,  *  =  1, . . . ,  k,  respectively.  Let  2£,;-,  j  =  1, . . . ,  n, 
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r» 

be  a  random  sample  from  population  7r» ,  i  =  0, 1, . . . ,  k,  X{  —  £  Y  Kij  be  the  sample 

;  =  i 

n  _  _  f 

mean  vector  and  S,  =  ^y  Y  (2£,;  -  2Ci)  (X,y  —  2£t)  be  the  sample  covariance  matrix 

j=i 

k 

within  the  population  tt,  and  S*  =  ^^y  ^  5,  is  the  pooled  sample  covariance  matrix.  We 

i=0 

will  discuss  the  classification  procedures  in  various  situations. 


4.1.  i  =  1, . . . ,  k,  known 

When  t  =  l,...,fc,  are  known.  We  define  U<  =  n  (Xo  —  iQ/S”1  (X0  —  Xj) , 
i  =  1, . . . ,  k.  For  this  case,  we  use  the  subset  selection  approach  and  consider  a  classification 
procedure  Pi6  as  follows: 

Rl6:  Classify  7To  as  one  of  the  7r,’s  for  which  Uy  <  ci6  min  Uy,  where  cie  >  1  is  the 

1  <j  <  k 

smallest  constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 


For  this  procedure,  we  have  the  following  result: 

Theorem  4.1.  inf  P{CC\Rie)  >  /0°°[1  -  Gp(2i/c16)]fc-1dGp(x).  (4.1) 

Proof.  Let  Uyj  =  n  (Xq  -  2C(y))  L(j)  (&o  ~  2C(y))  ’  =  lf  •  •  •  »*•  Given  %o  =  2,  Uyj,  J  = 
are  independent  and  U(,)  ~  x2  /  \  <  /  \  • 

«"(*«>-*)  sr/,(^r*) 

P(CC|J?i6)  =  P  {u(i)  <  ci6U(j),  j  =  2, . . .  ,&} 

=  /  P  {”  (~ (1)  ~~  “)  E(_1)  (— (1)  _  -  Cl6n  (^0)  ~  *)  (*(/)  -  2)  »  J  =  2,  •  •  • ,  *  j  dF(x) 

=  /  n  [X  -  Gp(y/Cie; n  (m0)  -  x)  (mU)  -  x)]  dGp  (y; n  (m(1)  -  x)  (m(1)  -  x) )  dP(x) 

>  J  nt1  ~  Gp(y/ciQ)WGP  (y; «(/£(i)  -2)  (m(1)  -  x)^  <if{x) 

=  p{u(1)  <c16Ufy),  j  =  2,...,fc} 

where  P  is  the  cdf  of  X0.  UW.  and  U(j)  are  independent  and  U(»  ~  *p- 

Now  U(X)  ~  2xp-  Hence 

P(CC|P16)  >  /  °°[1  -  Gp(2y/c16)]fc-1dGp(y).  □ 
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Remark  4.1.  If  we  use  the  measurement  V{  =  ^  -  £0j  (Eo  -f  E,)  1  .  Then  we 

have  an  easier  classification  procedure  Rn  defined  by 

Rn\  Classify  7ro  as  one  of  the  for  which  Uj  <  cn ,  where  U*  =  n  CX0  ~  X,)  (£o  + 
Ei)_1(2o  ~Ki)  and  c  17  is  the  smallest  positive  constant  such  that  the  probability  re¬ 
quirement  (1.1)  is  satisfied. 

It  is  easy  to  show  that  inf  P(CC\Rn)  =  Gp(ci7)  if  t/[i]  =  0. 

4.2.  E,-,  *  =  1, . . . ,  k,  unknown,  not  all  equal 


When  E^,  t  =  1, . . . ,  k,  are  unknown  and  Et,  *  =  1, . . . ,  k,  are  not  all  equal.  We  define 
V{  —  n  {XQ  -  Xi)'  S~l  (AT0  —  Xi)  i  *  =  1,  •  •  • , k.  A  classification  procedure  Rl8  is  defined 
as  follows: 


Ris :  Classify  iro  as  one  of  the  7T,’s  for  which  V,-  <  cis  min  V, ,  where  c18  >  1  is  the  smallest 

1  <]'<k 

constant  such  that  the  probability  requirement  (l.l)  is  satisfied. 

Given  Xn  =  x,  we  have  V,  ~  F  /  y_ /  \ .  Analogous  to  the  proof 

— 0  n~P  p,rt-p;n  (m,. -*}£,.  1  (£-  2.) 

of  Theorem  4.1,  we  have  the  following  result: 


Theorem  4.2.  inf  P{CC\Rl8)  >  /0°°[1  -  Fp,n-p{2y/clti)}k-ldFp<n_p{y). 
Proof.  /P{F(1)  <  c18V(  y),  j  =  2,...,k|Xo  =  x}  dF(x) 


(4.2) 


-  P  {^(i)  -  Cl8^(i)’  J  =  2,...,k} 

where  Vgj  and  K(1)  are  independent,  P(1)  ~  ^~2£fp>n_p  and  Vfo  ~  FP|ft-p,  j  = 

2, . . . ,  k.  Thus 


P(CC\Rx8)  > 


rOO 

J  [I  —  ^P,n-p(2y/c18)]*  1dFPln-p(y). 
Jo 


Remark  4.2.  We  can  define  an  easier  classification  procedure  Rig  as  follows: 


Rig:  Classify  7r0  as  one  of  the  7r,’s  for  which  V{  <  cig,  where  cig  is  the  smallest  positive 
constant  such  that  the  probability  requirement  (l.l)  is  satisfied. 

It  is  easy  to  show  that  inf  R(CC|Rig)  =  Fp>n-P  (  2(n-'i)pCl9) 1  ^  ^[i]  = 


14 


4.3.  £,■  =  £,  »  =  0, 1, . . . ,  k,  unknown 

When  E,  =  E,  i  =  0, and  E  is  unknown,  we  estimate  E  by  5*  and  define 
=  n  (X0  —  X,•),  5*_1  fX0  —  2Li)  !  *  =  1 Then  a  classification  R20  can  be  defined 
by: 


R20:  Classify  tt0  as  any  one  of  the  7Tj’s  for  which  Wi  <  C20  min  Wj,  where  C20  >  1  is  the 

l<j<k 

smallest  positive  constant  such  that  the  probability  requirement  (1.1)  is  satisfied. 

Given  Z„  =  x,  W,  ~  - ,  where  »'  =  (*+  l)(n  -  1). 
Analogous  to  the  proof  of  Theorem  4,1  and  Theorem  2.7,  we  have  the  following  result: 


Theorem  4,3.  inf  P(CC|-R20)  >  f*  /0°°[l  -  FPtV- _p+i(2y/ C2oz)} 


]k-i 


dFp,v.-p+i(y)dH'(z). 


(4.3) 


where  H*(z )  is  the  cdf  of  C/i1(5*E-1)/Chp(5*E-1). 

Proof.  P{W(1)  <  c2oW(j),  j  =  2,...,fc|Xo  =  x} 

=  P  |n  (Z(1)  -  x)  V"1  (x(1)  -  x)  <  c20n  (x(y)  -  x)  V"1  (x(;)  -  x)  ,  j  =  2, . . . ,  *  j 

>  P  {n  (x(1)  -  *)  V1  (x(1)  -  x)  <  Cchh^~[)c2on  (Z(y)  -^V1  (Zyj  -  x)  ,  ;  =  2,. 

=  Jo  Jo  n  [i  -  Pp,v-p+i  (y/c20z;n  (p(y)  -x)  E"1  (m(/)  -x)^  dFPtV>-p+i 
(y;n(M(1)-x)  E_I  (m(1)  ~  x^  dH*(z) 

>  Jo  J  (1 -Pp,v*-p+i(y/c2ox)]fc_1dPPlv-p+i  ^y;n(if(1) -x)  E"1  (m(1)  -  x)^  dJT*(x) 

=  P{W(1)  <c20W(*y),  j  =  2,...,A|Xo  =  x} 
where  ~  is  independent  of  Xq.  Therefore 


P(CC|P2o)  =  P{W(1)  <c20W(j),  j=2,...,fc} 

/•  1  fOO 

>  /  /  [1  -  Fp)V-_p+l(2y/c20^)]fc_1dPp(v--p+x(y)^P'*(x)-  □ 

Jo  Jo 
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Remark  4.3.  We  also  have  inf  P(CC|i?2o)  >  P{Z*  >  de) 

r  oo 

/  [1  ~  Fp,vm—p+i  (2y/c2o^e)]  dFp'V-—  p+ 1  (y) > 

Jo 

where  Z *  =  C^i(5*E-1)/C/ip(5*E-1). 

Remark  4.4.  Since  W,  ~  2v.^p+1  Fp,v--P+ we  can  define  an  easier  classification 
procedure  Rn  as  follows: 

J?2i*  Classify  7To  as  one  of  the  ^’s  for  which  W,  <  C21,  where  C21  is  the  smallest  positive 
constant  such  that  the  probability  requirement  (l.l)  is  satisfied. 

It  is  easy  to  show  that  inf  P(CC|P2i)  =  FPiV •  -P+ 1  (S“-'p  1,C21)  ii9W  =0- 
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'zone  approach  is  also  proposed.  It  should  be  pointed  out  that,  for  the  subset  selection 
■approach,  we  need  not  assume  that  the  individual  to  be  classified  belongs  to  one  of  the 
[several  given  categories.  -<^he  classification  procedures  depend  on  whether  the  parameters 
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